Weak-To-Strong Generalization
lesswrong.com·7h
Category Theory
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.com·2d
🎯Reinforcement Learning
Flag this post
Get Ready for Clojure, GPU, and AI in 2026 with CUDA 13.0
dragan.rocks·2d·
Discuss: Hacker News
🦀Rust
Flag this post
Model welfare and open source
lesswrong.com·7h
Incremental Computation
Flag this post
Economics and Transformative AI (by Tom Cunningham)
lesswrong.com·11h
🔍AI Interpretability
Flag this post
Evidence on language model consciousness
lesswrong.com·1d
🔍AI Interpretability
Flag this post
Agentic AI and Security
martinfowler.com·4d·
🚀MLOps
Flag this post
Reflections on 4 years of meta-honesty
lesswrong.com·4h
📮Message Queues
Flag this post
An intro to the Tensor Economics blog
lesswrong.com·3d
🔢Homomorphic Encryption
Flag this post
Clojure Runs ONNX AI Models Now - Join the AI fun!
dragan.rocks·6d·
Discuss: Hacker News
🚀MLOps
Flag this post
Decision theory when you can't make decisions
lesswrong.com·11h
🎯Reinforcement Learning
Flag this post
Freewriting in my head, and overcoming the “twinge of starting”
lesswrong.com·1d
🗃️Zettelkasten
Flag this post
LLM Hallucinations: An Internal Tug of War
lesswrong.com·3d
🔍AI Interpretability
Flag this post
Ink without haven
lesswrong.com·1d
Writing
Flag this post
Secretly Loyal AIs: Threat Vectors and Mitigation Strategies
lesswrong.com·1d
🔢Homomorphic Encryption
Flag this post
2025 Unofficial LW Community Census, Request for Comments
lesswrong.com·4h
🌿Digital Gardens
Flag this post